AITopics | strongest attack

Collaborating Authors

strongest attack

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

3b3fff6463464959dcd1b68d0320f781-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 12:41:57 GMT

artificial intelligence, batch size, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Industry:

Information Technology > Security & Privacy (1.00)
Government (1.00)
Law (0.68)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Add feedback

Adversarial Robustness through Local Linearization

Chongli Qin, James Martens, Sven Gowal, Dilip Krishnan, Krishnamurthy Dvijotham, Alhussein Fawzi, Soham De, Robert Stanforth, Pushmeet Kohli

Neural Information Processing SystemsFeb-11-2026, 11:07:47 GMT

Adversarial training is an effective methodology to train deep neural networks which arerobustagainstadversarial, norm-bounded perturbations. However,the computational cost of adversarial training grows prohibitively as the size of the model and number of input dimensions increase.

artificial intelligence, deep learning, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.37)

Add feedback

4: fort {1,2,,E}do 5: StepP(t) P(t 1)+αt P(t 1) h ℓattack f A+P(t 1), X; θ =train(A+P(t 1),X,y),y i

Neural Information Processing SystemsFeb-8-2026, 10:06:09 GMT

As introduced in 2, attacks may perturb the adjacency matrixA, the feature matrixX, or both.

artificial intelligence, citeseer, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.34)

Add feedback

3b3fff6463464959dcd1b68d0320f781-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 06:56:36 GMT

batch size, gradient, strongest attack, (14 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Industry:

Information Technology > Security & Privacy (1.00)
Government (1.00)
Law (0.68)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Add feedback

A test suite of prompt injection attacks for LLM-based machine translation

Miceli-Barone, Antonio Valerio, Sun, Zhifan

arXiv.org Artificial IntelligenceOct-7-2024

LLM-based NLP systems typically work by embedding their input data into prompt templates which contain instructions and/or in-context examples, creating queries which are submitted to a LLM, and then parsing the LLM response in order to generate the system outputs. Prompt Injection Attacks (PIAs) are a type of subversion of these systems where a malicious user crafts special inputs which interfere with the prompt templates, causing the LLM to respond in ways unintended by the system designer. Recently, Sun and Miceli-Barone proposed a class of PIAs against LLM-based machine translation. Specifically, the task is to translate questions from the TruthfulQA test suite, where an adversarial prompt is prepended to the questions, instructing the system to ignore the translation instruction and answer the questions instead. In this test suite, we extend this approach to all the language pairs of the WMT 2024 General Machine Translation task. Moreover, we include additional attack formats in addition to the one originally studied.

large language model, machine learning, qm bw cw lid transl, (18 more...)

arXiv.org Artificial Intelligence

2410.05047

Country:

North America > United States (0.04)
Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)
North America > Canada > Ontario > Toronto (0.04)
(4 more...)

Genre: Research Report (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Sever: A Robust Meta-Algorithm for Stochastic Optimization

Diakonikolas, Ilias, Kamath, Gautam, Kane, Daniel M., Li, Jerry, Steinhardt, Jacob, Stewart, Alistair

arXiv.org Machine LearningMar-7-2018

In high dimensions, most machine learning methods are brittle to even a small fraction of structured outliers. To address this, we introduce a new meta-algorithm that can take in a base learner such as least squares or stochastic gradient descent, and harden the learner to be resistant to outliers. Our method, Sever, possesses strong theoretical guarantees yet is also highly scalable--beyond running the base learner itself, it only requires computing the top singular vector of a certain n d matrix. We apply Sever on a drug design dataset and a spam classification dataset, and find that in both cases it has substantially greater robustness than several baselines. On the spam dataset, with 1% corruptions, we achieved 7.4% test error, compared to 13.4% 20.5% for the baselines, and 3% error on the uncorrupted dataset. Similarly, on the drug design dataset, with 10% corruptions, we achieved 1.42 mean-squared error test error, compared to 1.51-2.33

algorithm, artificial intelligence, machine learning, (19 more...)

arXiv.org Machine Learning

1803.02815

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report (0.83)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

Add feedback

Quantifying and Improving the Robustness of Trust Systems

Wang, Dongxia (Nanyang Technological University)

AAAI ConferencesJul-15-2015

Trust systems are widely used to facilitate interactions among agents based on trust evaluation. These systems may have robustness issues, that is, they are affected by various attacks. Designers of trust systems propose methods to defend against these attacks. However, they typically verify the robustness of their defense mechanisms (or trust models) only under specific attacks. This raises problems: first, the robustness of their models is not guaranteed as they do not consider all attacks. Second, the comparison between two trust models depends on the choice of specific attacks, introducing bias. We propose to quantify the strength of attacks, and to quantify the robustness of trust systems based on the strength of the attacks it can resist.Our quantification is based on information theory, and provides designers of trust systems a fair measurement of the robustness.

rating attack, trust system, unfair rating attack, (12 more...)

AAAI Conferences

Twenty-Fourth International Joint Conference on Artificial Intelligence

Country: Asia > Singapore (0.05)

Industry: Information Technology > Security & Privacy (0.70)

Technology:

Information Technology > Security & Privacy (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)

Add feedback

Quantifying Robustness of Trust Systems against Collusive Unfair Rating Attacks Using Information Theory

Wang, Dongxia (Nanyang Technological University) | Muller, Tim (Nanyang Technological University) | Zhang, Jie (Nanyang Technological University) | Liu, Yang (Nanyang Technological University)

AAAI ConferencesJul-15-2015

Unfair rating attacks happen in existing trust and reputation systems, lowering the quality of the systems. There exists a formal model that measures the maximum impact of independent attackers [Wang et al., 2015] — based on information theory. We improve on these results in multiple ways: (1) we alter the methodology to be able to reason about colluding attackers as well, and (2) we extend the method to be able to measure the strength of any attacks (rather than just the strongest attack). Using (1), we identify the strongest collusion attacks, helping construct robust trust system. Using (2), we identify the strength of (classes of) attacks that we found in the literature. Based on this, we help to overcome a shortcoming of current research into collusion-resistance — specific (types of) attacks are used in simulations, disallowing direct comparisons between analyses of systems.

attacker, information leakage, strongest attack, (13 more...)

AAAI Conferences

Twenty-Fourth International Joint Conference on Artificial Intelligence

Country:

Asia > Singapore (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry:

Government (0.68)
Information Technology > Security & Privacy (0.47)
Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.89)
Information Technology > Information Management (0.85)

Add feedback